Multilingual Knowledge-Based Concept Recognition in Textual Data
نویسندگان
چکیده
With respect to the increasing volume of textual data which is available through digital resources today, the identification of the main concepts in those texts becomes increasingly important and can be seen as a vital step in the analysis of unstructured information. Research in this area has focused on the detection of named entities like person names or organization names, which only cover a very small part of concepts in texts. Especially the unique mapping between concepts in different languages requires parallel corpora, which are rarely available in industrial settings. We therefore propose a powerful new knowledge based model to recognize various kinds of concepts even in very short and specialized texts using linguistic information for synonym handling and word sense disambiguation. We evaluate the proposed model on texts from the automotive domain.
منابع مشابه
A multilingual text mining approach to web cross-lingual text retrieval
To enable concept-based cross-lingual text retrieval (CLTR) using multilingual text mining, our approach will first discover the multilingual concept–term relationships from linguistically diverse textual data relevant to a domain. Second, the multilingual concept–term relationships, in turn, are used to discover the conceptual content of the multilingual text, which is either a document contai...
متن کاملManaging Multimodal and Multilingual Semantic Content
With the advent and increasing popularity of Semantic Wikis and the Linked Data the management of semantically represented knowledge became mainstream. However, certain categories of semantically enriched content, such as multimodal documents as well as multilingual textual resources are still difficult to handle. In this paper, we present a comprehensive strategy for managing the life-cycle of...
متن کاملExploiting Knowledge Bases for Multilingual and Cross-lingual Semantic Annotation and Search
The amount of entities in large knowledge bases (KBs) has been increasing rapidly, making it possible to propose new ways of intelligent information access. In addition, there is an impending need for systems that can enable multilingual and cross-lingual information access. In this work, we firstly demonstrate X-LiSA, an infrastructure for multilingual and cross-lingual semantic annotation, wh...
متن کاملConceptual Modeling with Formal Concept Analysis on Natural Language Texts
The paper presents conceptual modelling technique on natural language texts. This technique combines the usage of two conceptual modeling paradigms: conceptual graphs and Formal Concept Analysis. Conceptual graphs serve as semantic models of text sentences and the data source for concept lattice – the basic conceptual model in Formal Concept Analysis. With the use of conceptual graphs the Text ...
متن کاملL2 Learners' Acquisition of English Nominal Clauses: Effects of Textual Enhancement, Metalinguistic Explanation, and Self-Regulation
This study aimed to investigate the impact of textual enhancement and metalinguistic explanation as focus-on-form tasks tending to encourage the acquisition of nominal clauses (NCs) in English. It explored (a) whether textual enhancement and metalinguistic explanation would promote and enhance the knowledge of NCs, (b) whether these two tasks would differ in terms of enhancing learners' knowled...
متن کامل